Our domain of interest for this project is the COVID-19 pandemic. We are interested in analyzing data on COVID-19 because it is extremely relevant to our lives right now. The pandemic has affected all of us in one way or another. Therefore, we want to further examine and learn about the impact that the pandemic has had on our society.
The data set we are using for our project is the “AH Monthly provisional counts of deaths by age group, sex, and race/ethnicity for select causes of death” data set from the CDC, which provides a breakdown of COVID-19 deaths by different demographic groups (e.g., age group, race/ethnicity). The data was collected by the National Center for Health Statistics using death certificates of US residents from the start of 2019 until now. However, since the pandemic started in 2020, the total recorded COVID death in 2019 was 0, and the data set is not yet completed for 2021, we will only analyze the data pertaining to the year 2020.
According to the data, the total number of COVID deaths in 2020 was 731499, which is 22 percent of the total recorded deaths from all causes in 2020.The month that had the largest number of COVID death was 12, with a total number of 186129. Among all the age groups, population that are 85 years and over have the highest number of total COVID deaths. In addition, COVID deaths with underlying conditions takes up to 52 percent of the total COVID deaths. This correspond with the information provided by CDC, where elders and people with underlying medical conditions have a higher chance of death if getting infected. When we look into groups by race and ethnicity, data shows that Non-Hispanic White have the highest number of COVID deaths among all the other ethnicity groups, with a number of 439962 and takes up to 60 percent of the total COVID deaths in 2020. We can further explore the discrepancies of death rate among different groups from perspectives of sex, race/ethnicity and age.
The table below displays data regarding the general causes of death of various racial/ethnic groups for the year 2020. First, the table displays both the all-cause mortality rate and the death by natural cause rate (by racial/ethnic group). Additionally, the table also serves to convey the rate of deaths due to COVID-19, demonstrating the racial discrepancies in this rate. The table also displays the death rate when COVID-19 was a multiple cause of death, as well as the death rate when COVID-19 was an underlying cause of death. Overall, the purpose of this data is to convey the general/overall discrepancies among racial and ethnic groups in terms of causes of death, showing us what groups are potentially more prone to certain causes of death and allowing us to possibly think and acknowledge the societal implications of these differences.
| Race/Ethnicity | All Cause | Natural Cause | COVID-19 as a Multiple Cause of Death | COVID-19 as the Underlying Cause of Death |
|---|---|---|---|---|
| Hispanic | 307287 | 274342 | 69151 | 65377 |
| Non-Hispanic American Indian or Alaska Native | 24465 | 20862 | 4428 | 4058 |
| Non-Hispanic Asian | 91031 | 85807 | 13276 | 12452 |
| Non-Hispanic Black | 448770 | 401245 | 60940 | 56021 |
| Non-Hispanic White | 2483280 | 2294093 | 231618 | 208379 |
| Other | 29176 | 24837 | 3142 | 2877 |
The question of interest for this chart was how does the number of Covid deaths vary across age group. I chose to create a scatter plot encoding as it displays length to compare the amount of while also displaying the density at certain amounts of Covid deaths for the age groups. That way we can make observations on general spread trends across the data and spreads within particular age groups.
This chart illustrates that with higher age groups, the number of deaths caused by Covid increases. This can be attributed to the correlation between age and immunity. Younger people tend to have stronger immune systems than older people. Thus with a dangerous illness, those with worse immune systems are less likely to handle symptoms and survive the virus. Though the quantity of people per age group differed, the density and spread of the points per group clearly display the pattern of higher age groups having the highest Covid death rates.
Below is a pie chart, which shows the proportions of males vs. females who died due to COVID-19 and the total number of deaths that the proportions represent. We want to be able to answer the question of whether or not the numbers of COVID-19 deaths vary among different sex groups. Therefore, using a pie chart is appropriate because it allows for easy comparison between the two sex groups.
The pie chart reveals that there were 191,798 male individuals and 157,366 female individuals who died due to COVID-19 in the US in 2020. Approximately 54.9% of those who died due to COVID-19 were male and 45.1% were female. Although the difference between the percentages is not huge, there is certainly still a disproportion that shows that more men died with COVID-19 as an underlying disease than women. Overall, this implies that there may be an association between sex and COVID-19 susceptibility, possibly due to additional factors such as immunity, physiological, or lifestyle differences.
This chart helps us understand how the deaths due to COVID changed over the course of the pandemic, and the effect of that change on the death rate. Around April, the death rate spikes and then slowly reduces over the next few months. This is most likely due to the start of quarantine. After being in quarantine for a few months, the death rate starts to rise again, as shown by the bars from October to December, where the quarantine is either no longer useful, or no longer being strictly followed.